Journal: Cell Reports Methods
Article Title: A statistical approach for systematic identification of transition cells from scRNA-seq data
doi: 10.1016/j.crmeth.2024.100913
Figure Lengend Snippet: Identifying transition cells based on gene pairwise Pearson’s correlation coefficients (A) Stable cells and transition cells are separated through their intrinsic gene pairwise Pearson’s correlation coefficients (GPPCCs). In a Waddington’s landscape illustrating developmental processes, there are “valleys” and “ridges.” Valleys correspond to stable cellular states, and ridges represent barriers separating these stable states. During developmental processes, cells may transit from one stable state to another due to the change in the local landscape. We modeled these transitions as a result of the change in gene regulatory relations using stochastic differential equations (SDEs). Based on our mathematical derivations, gene pairwise correlation coefficients for transition cells are closer to ±1 compared with stable cells (illustrative heatmap: x axis, cells; y axis, gene pairs; color, values of GPPCCs) (see ). We further defined a transition index, which is proportional to the transition probability, to identify transition cells. (B) Transition cells identification workflow. To identify transition cells, we developed an analytical workflow containing several steps. We first did data preprocessing, including quality control, finding neighbors of each cell and obtaining the gene list with the largest expression variations. Then, GPPCCs were calculated for each cell by using the expression profiles of the cell and its nearest neighbors. Based on the empirical distribution of coefficients from all cells, a transition index, which is proportional to the transitioning probability, was calculated for each cell. (C)–(F) Identifying transition index using a simulation dataset. The simulation dataset is generated using SERGIO containing three steady states with linear transitioning structure. There are 5,000 stable cells in each steady state and 1,000 transition cells transitioning from state 1 to state 2 and state 2 to state 3. (C) UMAP of all cells with transition cells highlighted in red. (D) UMAP colored by transition index. (E and F) Evaluation with doublets. A total of 1,000 stable cells from state 1 and state 2 are randomly selected to generate doublets. (E) UMAP colored by the state of cells. (F) UMAP colored by transition index.
Article Snippet: This is because the strengths and connections of regulatory networks can change during cellular state transitions, , while CellTran requests gene pairwise Pearson’s correlation coefficients (GPPCCs) to be calculated from cells that have similar regulatory profiles.
Techniques: Control, Expressing, Generated